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Abstract 

Prior research demonstrates that examiner unfamiliarity 
negatively affects the optimal performance of handicapped 
preschoolers. The present investigation sought to determine whether 
examiner unfamiliarity also interferes with the optimal performance of 
handicapped school-age pupil's and nonhandicapped children. Sixty-four 
SJb.jects (16 language-handicapped and 16 nonhandicapped preschoolers 
and 16 language-handicapped and 16 nonhandicapped school-age students) 
were tested twice during a period of 2 weeks, once by a familiar 
examiner and once by an unfamiliar examiner, within a crossover 
design. A significant interaction was obtained for examiner 
familiarity and handicapped^ status, indicating that whereas 
nonhandicapped subjects scored similarly when tested by familiar and 
unfamiliar examiners, handicapped children scored higher with the 
familiar tester. Thus, findings indicated that examiner unfamiliarity 
negatively affects both language-handicapped preschool and school -age 
children's performance relative to a normative population; it appears 
to depress selectively handicapped children's test performance, 
thereby indicating that an examiner's unfamiliarity constitutes a 
negatively, systematically biasing condition and threatens the 
validity of handicapped students' test performance. 



Systematic Bias in the Assessment of Handicapped Children 

There is general and chronic disregard for the importance of 
situational variables in the assessment of children. This is true 
whether the purpose of testing is for educational programming (Labov, 
1973; Mehan, 1978;; Sigel, 1975) or research (Bronf enbrenner , 1979; 
Cazden, 1973; Cole & Bruner, 1972; Mishler, 1979). Nevertheless, a 
limited / but growing body of empirical evidence demonstrates that 
systematic variation of certain typical features of the assessment 
situation predictably affects test performance (Sattler, 1974). 

The effect of the situational factor "examiner familiarity" has 
been explored relatively frequently. Interest in this factor often 
has been based on the long-standing developmental notion that children 
derive much of their comprehension and feelings about a situation from 
significant adults in that situation (cf. Freud, 192.1-22; Piaget, 
1965). A recent literature review (Fuchs & Fuchs, 1983) identified 20 
investigations of examiner familiarity effects. Twelve of these 
studies reported subjects performing significantly better under the 
familiar condition. Of this group, 10 investigations involved 
handicapped, minority, and/or preschool subjects, thereby suggesting 
the relative importance of examiner familiarity to children with one 
or more of these characteristics. Handicapped preschoolers 1 test 
performance, appeared to increase most consistently in the familiar 
examiner condition. Among six pertinent studies using handicapped 
preschoolers as subjects, five (92%) investigations reported positive, 
significant results. 

By demonstrating th?t handicapped preschoolers, and other select 
groups, perform differently with familiar and unfamiliar examiners, 

e . 



recent research has identified a source of error in the manner in 
which tests commonly are administered. Findings indicate that the 
typical testing procedure discourages handicapped preschoolers 1 
optimal, absolute performance. A related and more important issue is 
whether examiner unfamil iarity negatively influences these children's 
performance relative to some normative populations; i.e., whether the 
strangeness of a tester selectively depresses the performance of 
handicapped preschool children. If so, a tester's unfamiliarity would 
be biased systematically against, and would threaten the validity of, 
handicapped preschool children's performance. * The present study 
explored this possibility by employing both handicapped and 
nonhandicapped preschool children. 

This study also sought to extend previous research along a 
second, heretofore unexplored dimension: namely, whether school-age 
handicapped children, as well as handicapped preschoolers, demonstrate 
dif f erenti al performance in ^avor of the f ami 1 i ar tester . By 
examining preschool and school-age children with a similar 
handicapping condition, this study explored whether handicapped 
preschoolers' sub-optimal performance with strange testers is, for the 
most part, a developmental issue, or whether it is more closely 
related to the fact that they are handicapped, Nonhandicapped school- 
age children also were incorporated into the design to address the 
possibility of a systematically biasing effect of examiner 
unfamiliarity at the school-age level, as well as among preschoolers. 
Finally, in an effort to shed light on the possibly mediating roles of 
CA and handicapping conditions in examiner familiarity/unfamil iarity 
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effects, testers 1 ratings of children's test-related behaviors were 
collected and analyzed. 



A total of 64 subjects comprised four different groups of equal 
size: 16 handicapped preschoolers (HP), 16 handicapped school-a^e 
children (HS) , 16 nonhandi capped preschoolers (NP) , and 16 
nonhandi capped school -age children (NS) . The HPs and HSs were 
moderately to profoundly speech- and/or language-impaired, performed 
within the normal range on individually administered 10 tests, and 
were participating in one of two language programs (preschool or 
elementary school levels) that were administered by the same public 
educational collaborative. The admission criteria of these special 
education programs included performance on speech and/or language 
measures that was at least lh standard deviation? below the mean. NPs 
and NSs were drawn from a large college-affiliated nursery school and 
public elementary school, respectively. All subjects were Caucasian, 
English-speaking, and from predominantly middle-class families located 
in five contiguous towns in Central Massachusetts. The mean CA for 
handicapped and nonhandicapped subjects was 77.44 (SD = 24.91) and 
76.91 (SD = 24.49) months, resDecti vely; average CA for preschool and 
school-age subjects was 57.16 (SD = 7.20) and 97.19 (SD = 18.48) 
months , respectively. A two-way analysis of variance (ANOVA; 
Handicapped vs. Nonhandicapped and Preschool vs. School-age) revealed 
a significant difference between the CAs of preschool and school-age 
subjects, £(1,60) = 128.28, £ < .001; there was no significant 



Method 



Subjects 




disparity in CA between handicapped and nonhandicapped children, 
F(l,60) = .02, ns, and there was no significant interaction, F(l,60) = 
.00, its. Additionally,. identical numbers of male (N = 20) and female 
(N = 12) subjects constituted the handicapped, nonhandicapped, 
preschool, and school-age groups. 
Examiners 

There were 32 examiners. All were Caucasian, female, and trained 
as speech clinicians at a state college in Central Massachusetts. 
Sixteen examiners were seniors who were one month away from graduation 
and who had completed a year-long practicum in clinical or educational 
settings. The remaining 16 testers were enrolled in a Masters program 
in Communication Disorders and were practicing speech clinicians. 
Table 1 is a summary of the examiners in terms of whether they tested 

handicapped, nonhandicapped, preschool, or school-age subjects, and 

I 

reports means and standard deviations for the four groups 1 CA and 
professional experience at the preschool and school-age levels. Two- 
way ANOVAs (Handicapped vs. Nonhandicapped and Preschool vs. 
School -age) revealed no significant differences between the examiner 
groups on any of these variables. (See Table 1 for £ and £ values,) 
Additionally, a multivariate two-way analysis of variance indicated 
there were no differences between the examiner groups in terms of 
their attitudes toward and conceptualization of handicapped vs, 
nonhandicapped adults or children as measured on the Role Category 
Questionnaire (Crockett, 1965; see Fuchs, Fuchs, Dailey, & Power, 
1983, for details on administration and scoring). 
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Insert Table 1 about here 



Design 



Each examiner was matched randomly with one of the four study 
sites, resulting in the combination of four seniors and four 
professionally experienced speech clinicians per subject group. Each 
of eight examiners per site was assigned randomly to four subjects, 
two with whom they became personally familiar and two to whom they 
remained strangers. In this way, examiners served in both familiar 
and unfamiliar roles, thereby controlling for potentially confounding 
effects of testers' personality. 

Wi th ever j exanii ner testing four ch i 1 dren , each of the 16 
subjects was assessed twice during a period of two weeks, once by a 
familiar examiner and once by the unfamiliar examiner within a 
crossover design. ~ Examiners assessed one familiar and one unfamiliar 
subject on one occasion and another familiar /unfamiliar pair of 
subjects on a second day. On their first day of testing, one-half of 
the examiners first tested familiar children, then unfamiliar 
children; the remaining examiners tested their examinees in reverse 
order. On the second day, examiners who previously had tested 
initially familiar subjects were required to test unfamiliar children 
first; those examiners who, on their first day of testing, had 
assessed unfamiliar children first were required on their second day 
to test them after familiar examinees. The time of day that subjects 
were assessed also was controlled; if a child was tested by the 
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familiar examiner at 9:00 a.m., this child also was examined at 9:00 
a.m. by the unfamiliar tester. Finally, testing at each site was 
conducted in a quiet setting that was familiar to the children. 
Procedure 

Selection of nonhandicapped subjects . NPs and NSs were selected 
by a stratified random sampling procedure. Specifically, for every 
handicapped subject, all nonhandicapped children in the nursery and 
elementary schools were identified (a) whose CAs were either three 
months below or above the CA of the handicapped subject, (b) who were 
of the samejgender, and (c) who were not receiving special education 
services. This generated 32 overlapping groups (16 at preschool level 
and 16 at school -age level) in which membership ranged, from 4 to 9. 
From each of these groups, two nonhandicapped children were drawn 
randomly, one designated arbitrarily to participate in the study and 
the other assigned to a replacement group. 

Selection of examiners . Examiners were chosen in a four-stage 
process. First, an initial pool of 95 prospective examiners was 
established by identifying those graduate students (N = 52) and 
seniors (N - 43) who matriculated in the Communication Disorders 
Department. Second, the records of these students were examined to 
determine whether the prospective examiners met three criteria; 
namely, examiners were required to (a) be female; (b) have a minimum 
of H year experience with preschool children either as a professional 
speech clinician or as a student-in-training; and (c) have completed 
successfully a year-long practicum (for seniors) or an advanced 
diagnostics course (for graduate students) . Application of these 
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criteria reduced the initial pool to 72 students, 35 graduate students 
and 37 seniors. Third, 22 individuals were chosen randomly from each 
of these groups, with 16 in each group arbitrarily designated to 
participate as examiners and 6 1n each group assigned to a replacement 
group. Finally, the 32 prospective examiners were sent letters 
informing them (a) of an opportunity to participate in an 
, investigation whose "aim is to study the validity of using 

standardized tests to assess the language performance of various 
groups of children, " (b) of the nature and extent of their 
responsibil ity as study participants, and (c) that, in return for 
their participation, they would be awarded stipends and an opportunity 
to earn academic credit. 

Personal famil iari ty . Examiners 1 personal famil iarity was 
induced experimentally by two procedures. First, examiners who were 
assigned to preschool subjects were required to make one home visit to 
each of their two ■■familiar" children. Testers were told that the 
purpose of these visits was "to get to know the child and to permit 
the child to get to know you." Examiners al so were instructed to 
bring toys and games with which to play with the children. The home 
visit was to last about one hour. If subjects were sciool-age, 
examiners spoke with their future examinees by telephone instead of 
visiting them in their homes. Examiners were asked to (a) introduce 
themselves, (b) discuss briefly the upcoming testing, and (c) inform 
the children that, prior to testing, there would be an opportunity to 
pi ay. Testers and chi Idren explored together materials that they 
could bring to this interaction, as well as special activities they 

\ 
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might pursue. Subjects were made to expect the telephone calls via 

\ 

their classroom teachers, and the phone conversations were to last 
about five minutes. 

The study's second strategy to induce personal familiarity 
required testers to play with their familiar children on a one-to-one 
basis in a quiet space outside of the subjects classroom. Each tester 
of preschool children provided the same toys and games with which she 
and the child had played during the home visit; examiners of school- 
age subjects attempted to bring mater i als and/or conduct activities 
that h^d been mentioned during their phone conversations. The play 
sessions always immediately preceded testing and lasted 20 and 30 
minutes for preschool and school-age sqbjects, respectively. Research 
assistants monitored and occasionally t regulated the duration of these 
play sessions. The lapse in time separating the home visit or phone 
conversation and testing ranged from two to five days. 

Following the data collection phase of this investigation, 
examiners were asked to complete a questionnaire that was administered 
during a debriefing session. Their responses indicated that the 
average home visit (to preschoolers) lasted 60.63 minutes (SD = 20.70) 
and the average phone conversation (with school-age children) lasted 
5.81 minutes (SD ? 4.69). In combination with time spent during the 
play sessions immediately preceding testing, preschool and school-age 
subjects spent averages of. 80. 63 minutes (SD = 20.70) and 34.72 
minutes (SD = 7.50), respectively, with their familiar examiners prior 
to testing. Handicapped children's total interaction- time with 
'familiar testers averaged 56.144 minutes ( SD^^25-.^7 ) ; nonhandicapped 



subjects averaged 58.91 minutes (SD = 29.92) of pretest contact with 
familiar examiners. A two-way ANOVA (Hanciapped vs. Nonhandicapped 
and Preschool vs. School-age) indicated that, whereas preschool 
subjects " participated in significantly greater amounts of pretest 
contact with examiners than did school-age subjects, £(1,60) = 137.57, 
£ < .001, handicapped and nonhandicapped subjects did not differ in 
this respect, £(1,60) = .40, njs. 
Measures 

The Clinical Evaluation of Language Functions (CELF; Semel-Mintz 
& Wiig, 1982) is a comprehensive language test that is comprised, of 
two scales, processing (PS) (i.e., auditory comprehension) and 
production (PC) (i.e., 1 verbal expression). Semel-Mintz and Wiig 
(1982) reported internal consistency of .82 and a test-retest 
reliability coefficient of .96. Pilot administrations, of the CELF 
were conducted with language-impaired preschoolers (CAs: 3-10 to 4-8) 
and nonhandicapped, above-average achieving intermediate grade 
children (CAs: 9-4 to 12-1) to determine the possibility of "floor" 
and "ceiling" effects, respectively. Results indicated the CELF was 
appropriately difficult for the study's diverse sample. Examiners 
were trained to administer the CELF in one three-hour training 

session, which was conducted by a certified speech clinician. 

/ - 

Schaeffer's Teacher Rating Scale (STRS: Schaeffer, n.d.) lists 

/ 

23 traits and their polar opposites that describe students 1 
personality (e.g., nervous/relaxed), sociability (e.g., shy/outgoing), 
and work habits (e.g.* easily distracted, concentrates). Each bi- 
polar^ dimension is rated along a 7-point scale where 1 and 7 signify 
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most negative and positive responses, respectively. Each examiner 
completed the STRS for her two familiar and two unfamiliar examinees 
during a debriefing session that was conducted one week following the 
last administrations of the CELF. Internal consistency alpha for the 
STRS was .97. (The STRS is provided in the Appendix.) 

R esults 

Means ' and standard deviations on each scale of the CELF are 
di-sDlayed in Table 2, for each examinee group in both familiar and 
unfamiliar testing conditions. 



Insert Table 2 about here 



The PS scale on the CELF is comprised of six subtests; the PC 
scale, four. In order to compare children's performance on the two 
scales, weighted scores were calculated: the PS scale score was 
divided by six and the PC scale score by four. Then, a two-between 
(Handicapped vs. Nonhandicapped and Preschool vs. School age), two- 
within (Familiar vs. Unfamiliar tester and the CELF scales, PS vs. PC) 
analysis of variance (ANOVA) was run on the CELF. This ANOVA revealed 
significant £ values for handicapped status, £(1,60) = 16.98, p < 
.001, age status, £(1,60) = 81.18, £ < .001, for the handicapped by 
age interaction, £(1,60) = 11.98, £ < .001, for the CELF scale by age 
interaction,' £(1,60) = 7.90, £ < .01, and for. the familiarity 
condition by handicapped status interaction, £(1,60) = 4.00, £ < .05. 
Weighted mean scores and standard deviations on the two CELF scales 
for each of the four examinee groups in both unfamiliar and familiar 
testing conditions are shown in Table 3. 
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Insert Table 3 about here 



Inspection of Tables 2 and 3 reveals that nonhandicapped students 
performed better on the CELF than did the handicapped children (mean 
difference = 37.28 unweighted points or 7.24 weighted points), and 
that school -aged youngsters scored higher than preschoolers (mean 
difference = 75.28 unweighted points or 15.84 weighted points). The 
interaction between the familiarity condition and handicapped status 
is illustrated in Figure 1 for the weighted CELF scores and suggests 
that whereas nonhandicapped children scored similiarly when tested by 
familiar and unfamiliar examiners (scoring, on the average, 0.05 
points, lower on each subtest with the familiar tester), handicapped 
youngsters scored higher. when tested by the familiar examiner (scoring 
an average 1.86 points better with the familiar tester on each 
subtest). 



Insert Figure 1 about here 

Additional \ significant interactions indicated that (a) the 
difference between preschool and school -age children 1 s performances 

was greater for nonhandicapped than for handicapped youngsters, and 

/ 

(b) that the difference between performances on the PS and PC scales 
of the CELF was greater for preschool than /or school-age children. 
However, because these two findings are extraneous to the purpose of 
this paper, they are not discussed further. 
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For each Schaeffer Rating Scale protocol, the total score was 
divided by the number of items on the scale to derive an average score 
per item. On these average scores per item, a two-between 
(Handicapped vs. Nonhandicapped and Preschool vs. School-age), one- 
within (Familiar vs. Unfamiliar) ANOVA was run. The ANOVA yielded one 
significant F value for age status, F(l,50) = 4.26, £ < ,05; one 
additional F value approached significance for handicapped status, 
F(l,60) = 3.36, £ = .07. No significant effect was obtained for 
familiarity status, f_(l,60) = .03, its. Means and standard deviations 
on the Schaeffer, for each of the four examinee groups in both 
unfamiliar and familiar testing conditions are shown in Table 4. 
These means indicate that school-age students' ratings on the 
Schaeffer were an average .49 point higher per item than were the 
ratings of preschoolers, and that nonhandicapped youngsters were rated 
an average .44 point higher per item than were the handicapped 
students. Testers' ratings of familiar and unfamiliar subjects were 
nearly identical . 



Insert Table 4 about here 



Discussion 

Findings indicate that, in comparison to nonhandicapped children, 
handicapped subjects performed significantly stronger when tested by 
an examiner with whom they/were personally familiar. This result 
extends previous research on examiner familiarity in two important 
ways. First, whereas prior investigations, demonstrated that tester 

17 



13 

unfamiliarity discourages handicapped preschoolers' optimal, absolute 
test performance, results from this study show that the strangeness of 
a tester negatively influences handicapped children's performance 
relative to a normative population; it appears to depress selectively 
handicapped children's /test performance. This finding suggests that 
an examiner's unfamiliarity constitutes a negatively, systematically 
biasing condition and threatens the validity of handicapped students' 
test performance. - 11 

Results from the present investigation also extend, the importance 
of examiner familiarity from preschool to school-age handicapped 
children. The absence of an interaction between familiarity and CA or 
among familiarity, handicapping condition, and CA suggests there may 
be something about a child 1 s handicap, irrespective of CA, that 
promotes differential test performance. Such a causitive influence 
may originate within the child (e.g., self-perceived vulnerability); 
however, it also may be rooted in (a) examiners 1 attitudes and 
behaviors directed toward handicapped children (see Fuchs et al., 
1983; Fuchs, Zern, & Fuchs, in press) or (b) an interaction between 
the perceptions and actions of examiner and handicapped examinees. 

In any case, the apparently negatively biasing nature of examiner 
unfamiliarity for handicapped children has many implications for 
educational practice. One such implication involves the level of 
confidence we may place in the predictive validity of a test with 
respect to students 1 classroom performance. Despite the fact that 
most testers are unfamiliar and classroom teachers are familiar to 
pupils, predictive validity presupposes a high degree of similarity 
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between the conditions under which both testing and classroom learning 
occurs. While this presupposition may hold true for nonhandicapped 
children, present findings indicate it is not true for handicapped 
students; the strangeness of an examiner appears to transform the test 
situation into a qualitatively different experience from that of a 
classroom setting. Hence, what is learned about a handicapped child 
in the typical testing environment may have scant relevance for and 
may be seriously misleading about classroom instruction. 

The salience of a strange tester for handicapped, but not for 
nonhandicapped children, also questions the adequacy \of many tests 1 
standardization populations, when such groups are employed to 

determine handicapped pupils 1 test performance. Most psychological, 

•* / 

language, and educational achievement' tests include few if any 
handicapped children in their normative populations. Regardless of 
whether these largely nonhandicapped groups were tested by familiar or 
unfamiliar examiners during the standardization phase, we have no 
reason to believe their optimal performance was compromised. In 
contrast, examiner unf amiliarity has been seen to discourage 
handicapped children's optimal functioning. Thus, when we employ an 
unfamiliar examiner to assess handicapped pupils and when we compare 
these children's performance to a typical standardization population, 
we are comparing handicapped* children's suboptimal functioning to the 
presumably optimal * performance of the normative ^group. 

This seemingly "apples and oranges' 1 situation is an explicit 
violation of PL 94-142 and Section X 5p4, which stipulate that, for 
handicapped students, test instruments must be selected and 

19 
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administered to "accurately < reflect the student 1 s aptitude or 
achievement level or whatever other factors the test, purports to 

measure " (34 C.F.R., §§ 104.35 £bj £33 and 300.532 £c] £33, cited 

in McCarthy, 1983). 

This investigation also explored examiners 1 ratings of subjects 1 
test behavior on the STRS. Testers described school-age children ! s 
behavior as more positive than preschoolers' behavior, and examiners' 
higher ratings of nonhandicapped than handicapped examinees* behavior 
approached significance. These results paralleled findings that 
school -age and nonhandicapped subjects performed more strongly on the 
CELF than preschool and handicapped children, respectively. Together 
these findings seem to corroborate prior research demonstrating a 
positive relation between children's performance on tests and their 
adaptive, general behavior in the test setting (e.g., Sigel, 1975). 
In contrast, however, examiners rated familiar and , unfamiliar 
handi capped chi 1 dren 1 s test behaviors si mi 1 arly, despite the better 
performance of handicapped children on the CELF in the familiar 
condition. A previous study (Fuchs et al., 1983), also employing 

speech clinicians as testers but using a different rating instrument, 

v; 

generated similar results. 

It is unclear why testers' ratings of handicapped examinees on 
the STRS failed to reflect these subjects 1 stronger CELF' performance 
in the familiar condition. Whatever the reason may be for this 
seeming insensiti vi ty, there is an important implication: Although 
many tests depend on examiners to exercise their judgment to determine 
whether, and if so when, optimal performance has been established, 



20 



16 

results from this study and prior research suggest testers have 
difficulty making this determination, at least with respect to select 
populations. Further research might explore the utility of more 
objective procedures, such as a checklist or rating scale that 
specifies dimensions of rapport, with which testers may determine more 
accurately a child's valid test performance and his/her level of 
comfort during an examination. 
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Table 1 

CA and Experience of Examiners Who Were Assigned to Handicapped, 
Ndnhandicapped, Preschool, and School-age Subjects 3 



ANOVA 

Handicapped Nonhandl capped Preschool School -age Factor 



rlence 



inence 



Factor 




df 


F-value 


£-value • 


H/NH b 




1,28 


1.81 


.ir--^ 


P/S c 




1,28. 


. .00 


1.00 


H/NH x 


P/S d 


1,28 . 


.16 


.70 


\ 










H/NH . 




1,28 


.14 . 


.71 


P/S 




1,28 


.02 


.88 


H/NH x 


P/S 


1,28 


1.71 


,20 


N/NH 




1,28 


.21 


.65 


P/S 




1,28 


.69 


.20 


N/NH x 


P/S 


1,28 


• .03 


.86 



:r1es are means and standard deviations In parentheses, which are calculated In terms of months, 
idlcap vs. nonhandicap 
tschool vs. school-age 

fraction between handlcdp/Qojtbandlcap and preschool/school-age' 




\ 
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Table 2 

Unweighted Means and Standard Deviations on the CELF Scales for Each Examinee 
Group in Familiar and Unfamiliar Testing Conditions 9 



CELF Scale 



PS PC 



Examinee Group 


Familiar 


Unfamiliar 


Famil iar 


Unfamil iar 


Preschool Handicapped 


96.31 


(36.62) 


83.38 (38.86) 


36.31 (33.57) 


35.75 


(37.22) 


School-age Handicapped 


139.94 


(39.52) 


137.81 (43.26) 


90.94 (44.52) 


75.25 


(34.75) 


Preschool Nonhandi capped 


101 .56 


(45.64) 


101.31 (40.15) 


45.13 (28.42) 


40.69 


(29.23) 


School-age Nonhandicapped 


211 .63 


(32.38) 


216.13 (30.39) 


137.81 (51.40) 


138.25 


(52.71) 



a Standard deviations are indicated in parentheses. 
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Table 3 

Weighted Means and Standard Deviations on the CELF Scales for Each 
Examinee Group in Familiar and Unfamiliar Testing Conditions 9 







CELF 


Scale 








PS 


PC 




Examinee Group 


Famil iar 


Unfamiliar 


Fami liar 


Unfamil iar 


'reschool Handicapped 
>chool-age Handicapped 
'reschool Nonhandicapped 
School -age Nonhandicapped 


16.09 (6.08) 
23.33 (6.59) 
16.92 (7.61) 
35.28 (5.40) 


13.90 (6.47) 
22.14 (7.23) 
15.84 (5.13) 
36.03 (5.08) 


9.10 (8.40) 
22.76 (1 1 .1 3) 

9.74 (6.10) 
34.48 (12.85) 


8.96 (9.31) 
18.83 (8.67) 
10.19 (7.32) 
34.58 (13.19) 



^Standard deviations are indicated in parentheses. 



■r %± 
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Table 4 

Average Score Per Item on the Schaeffer for Each Examinee 
Group in Familiar and Unfamiliar Testing Conditions 



Score 





Famil iar 




Unfamil iar 




Examinee Group 


Mean 




SD 


Mean 




SD 


Preschool Handicapped 


4.41 


0 


.94 


4.27 


1 


.01 


School -age Handicapped 


.4.56 


0 


.71 


5.00 




.80 


Preschool Nonhandi capped 


4.87 


0 


.94 


- 4.58 


1 


.09 


School-age Nonhandicapped 


5.36 


0 


.82 


5.18 


0 


.95 



2i) 
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Figure 1. Display of interaction: CELF scores of handicapped 

(„-) and nonhandicapped ( ) children in familiar IF) 

and unfamiliar (U) testing conditions. 



30 

ERIC 



APPENDIX 
Schaeffer Teacher Rating Scale 



Child's Name: 



Compared with other children the same age, how would you describe this child 
on the traits listed here. For example, if the item had to do with height, and 
the child was just a little shorter than average, you might mark it as follows: 

Example: short 1 2 ® 4 5 6 7 tall 

Please circle a number on each line below to show where you would rate this 
child on the following traits: 



average 



1 . 


shy 


1, 2 


3 


4 


5 


6 


7 


outgoi ng 


2. 


slow learner 


1 2 


3 


4 


5 


6 


7 


fast learner 


3. 


nervous 


1 2 


3 


4 


5 


6 


7 


relaxed 


4. 


gloomy 


1 2 


3 


4 


5 


6 


7 


cheerful 


5. 


short attention span 


1 2 


3 


4 


5 


6 


7 


long attention span 


6. 


needs long explanation 


1 2 


3 


4 


5 


6 


7 


catches on quickly 


7'. 


often worried 


1 2 


3 


4 


5 


6 


7 


seldom worried 


8. 


stubborn 


1 2 


3 


4 


5 


6 


7 


cooperative 


9. 


unhappy 


1 2 


3 


4 


5 


6 


7 


happy 


10. 


easily distracted 


1 2 


3 


•4 


5 


6 


7 


concentrates 


11 . 


sel f-conscious 


1 ' 2 


3 


4 


5 


6 


7 


comfortable # with people 


12. 


doesn't know much 


1 2 


3 


4 


5 


6 


7 


knows a lot 


13. 


high strung 


1 2 


3 


4 


5 


6 


7' 


easy going 


14. 


quarrel some 


1 2 


3 


4 


5 


6 


7 


agreeabl e 


15. 


moody 


1 2 


3 


4 


5 


6 


7 


even-tempered 


16. 


gives up quickly 


1 2 


3 


4 


5 


6 


7 


sticks to it . 


17. 


fearful 


1 2 


3 


4 


5 


6 


7 


sel f-confident 


18. 


not too bright 


1 2 


3 


4 


5 


6 


7 


smart 


19. 


anxious 


1 2. 


3 


4 


5 


6 


7 


calm 


20. 


easily angered 


1 2 


3 


4 


5 


6 


7 


good-natured 


21 . 


complaining 


1 2 


3 


4 


5 


6 


7 


contented 


22. 


careless about work 


1 2 


3 


4 


5 


6 


7 


does careful work 
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